AN EXTENSION OF THE FUNCTIONAL ITO FORMULA UNDER A FAMILY 

OF NON DOMINATED MEASURES 
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Abstract. Motivated by questions arising in financial mathematics, Dupire [12] introduced a notion 
of smoothness for functionals of paths (different from the usual Frechet-Gateaux derivatives) and 
arrived at a generalization of Ito's formula applicable to functionals which have a pathwise continuous 
dependence on the trajectories of the underlying process. We study nonlinear functionals which do 
not have such pathwise continuity and further work simultaneously under the family of continuous 
semimartingale measures on path-space. We do this without introducing a second component, as 
carried out by Cont-Fournie [9] [8] but by using old work of Bichteler [4] which allows to keep a 
■ pathwise picture even for complex functionals. 
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1. Introduction 
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It is an understatement to say that Ito's stochastic calculus is a useful tool for the modeling of 
■ systems that evolve under the influence of randomness. An essential part of this calculus is Ito's 
formula which, given a semimartingale X = (Xt) t>Q and a sufficiently smooth function /, shows that 
the stochastic process / (t, Xt) is also a semimartingale and moreover, provides an explicit (Bichteler- 
p l Dellacherie) decomposition of the process / (t,Xf) into a sum of a stochastic integral against X and 
a process of finite variation. Of course, the class of processes of the form / (t, Xt) is only a small 
subset of the stochastic processes that are adapted to the filtration generated by X, a (X), and often 
it is necessary to derive similar statements for this larger class of processes. One area where such 
questions arise is financial engineering: traders aim to understand the dynamics of the price process 
of a (highly path-dependent) contingent claim with respect to the underlying asset modeled by the 
stochastic process X. Motivated by these questions, Dupire |12j showed that if A" is a standard real- 
^-j. 1 valued Brownian motion, it is possible to extend Ito's formula to a nontrivial subset in the class of 
real-valued processes F = (Ft) t>0 which are adapted to a (X) and gave the formula 

t / -L \ rt 



(1.1) F t -F = J [d F r + -AF r )dr+ / VF r dX 



o 



where BqF^F and AF are again a (X)-adapted processes to which we refer as the causal derivatives 
of F (following M. Fliesa^; in [121 El 13 these are called functional time and space derivatives or also 
horizontal and vertical derivatives). For the special case Ft = f (t, Xt), these causal derivatives coincide 
with the usual time and space derivatives of / and the above reduces to the standard Ito-formula applied 
to f (t,Xf). However, the class of processes considered by Dupire is limited to those that depend 
continuousljH on the trajectories of X (e.g. F = J'f (r, X r ) dr but not F_ = f f (r, X r , [X] r ) dr etc.). 
While such a path-by-path continuity is of course guaranteed by differentiability in the case of the 
classic Ito-formula, Ft = f(t,Xt), it is not implied for general, a (X)-adapted processes F by causal 
differentiability; indeed, such a pathwise continuity would be a very strong restriction in the class of 
a (X)-adapted processes. This restriction was subsequently addressed by Cont-Fournie who consider 
general semimartingales X and add a second process (e.g. [X] or its weak derivative; cf. O [8] and the 
excellent thesis of D. Fournie) to express F as a functional which depends pathwise continuously on the 
trajectories of X and this second process which allows them to arrive at a generalization of Dupire's 
functional Ito-formula to a larger class of a (A")-adapted processes. This article is inspired by all these 
strong and beautiful results and shares the same goal of extending the class of functionals of X for 
which (jl.ip holds, however, we explore a different approach. We do not introduce a second process in 



Key words and phrases. Semimartingale decomposition, pathwise stochastic Integration, functional Ito formula. 

^We use the term causal since essentially the same operators can be found in the literature on nonlinear system control 
(i.e. deterministic, bounded variation paths), cf. [181 Section lib], where they are known as causal derivatives. 

2 in (not precisely but) essentially the uniform topology, cf. Example [13] and [9]; in fact, much of this article is concerned 
with avoiding the use of this metric. 
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addition to the continuous semimartingale X but nevertheless arrive at a generalisation of (jl.ip to a 
large class of functionals which includes adapted processes with a complex dependence on the past like 
the quadratic variation of X, stochastic integrals, Doleans-Dade exponentials, compositions thereof, 
etc. To do this we have to overcome some obstacles: 

• firstly, the pathwise nature of the causal derivatives involves operations on sets of paths which 
are null-sets, 

• secondly, many interesting processes are constructed by a probabilistic argument, hence are of- 
ten only uniquely defined as equivalence classes of indistinguishable processes but not pathwise 
unique, 

• thirdly, that many of these processes are not robust under approximations of X in uniform 
norm. 

Especially note the clash of the first point (pathwise considerations matter) and the second point 
(only equivalence classes under a fixed probability measure matter for many applications in stochastic 
analysis). We briefly elaborate on this and sketch our approach: Throughout we work on the canon- 
ical path-space (Q, F°, (J 7 ?)) of Revalued (possibly defective, cf. Section [TT} cadlag processes with 
Xt (u) = uj (t) denoting the coordinate process. In Section [2] we recall Dupire's causal derivatives. 
They are a family of maps R + x f2 — > R which describe the sensitivity of a given F : R + x £1 R 
with respect to perturbations of the coordinate process X at running time. If we denote with _A4* eTm 
the probability measures under which the coordinate process X is a semimartingale with (P-a.s) con- 
tinuous trajectories then we can consider the standard completion (0, J- p , J-f , P) of (fl, J-°,F^ (the 
reason why we work on the space of cadlag paths 0, is the definition of the causal space derivative). 
Now assume we are given an adapted, real-valued process F on (f2, J rP , J-f , P) , P £ Ai s ^ mi then even 
if we have a version of this F at hand which has causal derivatives then it is, as pointed out above, 
not always justified to make strong assumptions on the pathwise regularity of the map (t, u) i— > Ft (u>) 
(already for fixed t; e.g. if F is any version of the stochastic integral, quadratic variation etc.). There- 
fore we introduce in Section [3] the class of functionals C 1 ' 2 , i.e. a subset of the maps R + x — > R, and 
show in Section U] that the functional Ito formula extends to C 1,2 ; Theorem |22 on page 8| reads 



Theorem. Let F : R + x S7 — > R ; F G C 1 ' 2 . Then VP G Ai s c emi , F is a continuous semimartingale on 
(n,F p ,Ff,P) and 



d , d 



F-F = f d F r dr + J2 !' diFr.dXl + \ V /' d tj F r .d [X\X 3 
Jo i=l Jo 2 i,j=i Jo 



a.s. 



Of course, it remains to show that an interesting class of processes can be expressed as a functional 
in C 1 ' 2 : for the class of processes studied by Dupire (Example |13 on page 7[ ) this follows immediately. 
In Section [5] we do exactly this for the stochastic integral J Y^dX and the quadratic variation process 
[JPjX 7 ] using the pathwise Ito-integral due to Bichteler and Karandikar, for example we show in 
Section [5] 

Proposition. For every i,j 6 {1, . . . , d} there exists a map B*' : R+ x n -»• R such that G C 1 ' 2 and 
B % i is on every (Q, J-" p , J-f, P) , P G A^f™, an adapted process, indistinguishable from the quadratic 
variation process (constructed under ¥). 

Note that this is a non-trivial statement linking the map /functional B %3 with the quadratic variation 
process [X^X- 7 ], the latter being only uniquely defined as an equivalence class of indistinguishable 
processes on (fl, F ¥ , J-f, P) for every fixed P G M. s ^ mi . The difficulty is to choose a representant in 
this class which is an element of C 1,2 and additionaly also a representant for the quadratic variation 
constructed under any other measure in M^. emi , i.e. to find an aggregatoiH which is also differentiable 
- if we would restrict ourselves to a subset V C _Mj! erm which is dominated by a single measure, 
i.e. all elements in V are absolutely continuous wrt to this measure, then at least the existence of the 
aggregator would be trivial though not necessarily its differentiability. A similar result holds for the 
Ito-integral f Y- ■ dX and as a consequence of the results in Section [5] we get 



''Given a family V of probability measures and a family of processes {_F P : P G V, F r is a meas. process on (O, , J-f ) } 
we call F : R+ x Q ->• R an aggregator of this family if F = F T P-a.s. VP G V (see [35 for a slightly different definition). 
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Proposition. Let [/,, a : M + xS7 — > R be sufficiently regular (as in Theorem\3Qresp. Example\4'J\). Then 
there exists a map I : R + x Q -> R such that I G C 1 ' 2 and I is on every (Q, J-" p , J-f, P) ,P G .M* 6 "™, an 
adapted process, indistinguishable from the process ( constructed under P ) 

(1.2) / fi r dr + / a r -dX r . 

Jo Jo 

Moreover, d F = fj,, {diF) i=l 4 = (<J-) i=1> d and (dijF) iJ=14 = P-a.s. 

If we denote with V ac the subset of M s <? m% under which t i— > [X] t is absolutely continuous wrt to 
Lebesgue measure then we can sum up the above in the language of quasi-sure analysis 155] : F = F 
■p ac -quasi-sure for some F G C 1,2 iff F is of the form (jl,2p "P ac -quasi-surd3 (under suitable assumptions 
on the coefficients /i,cr); further, the class C 1 ' 2 contains aggregators with complex pathdependence. 
We emphasize that we do not extend the causal derivatives by closure of operators on equivalence 
classes of indistinguishable processes to cover Ito-processes which leads to strong results under one 
fixed measure P but develop a pathwise picture applicable simultaneously to all P G M s ^ m% . The 
approach to consider a stochastic calculus under a family of non-dominated probability measures is 
motivated by recent developments in probability theory like Denis' and Martini's uncertain volatility 
model and quasi-sure analysis via aggregation |33| [TT] as well as Peng's G-expectation [29, 31J. Of 
course, plugging in above functionals B and / into the functional Ito formula leads to trivial identities 
but the point is that compositions with smooth functions or functionals are again in C 1 ' 2 (e.g. see 
the examples in Section [5] or [20j for more applications in finance). Further, the only requirement 
for a functional F : M+ X f2 — > R for being in C 1,2 is besides its causal differentiability (and some 
minor regularity assumptions) the convergence of F (and its derivatives) under finite-dimensional 
approximations, F (oj n ) — > n F (ui) uniformly on compacts in P-probability, VP G M s c emi , where (oj n ) n 
is a piecewise constant approximation to cj, which — in view of the usual approximation results in 
Ito-calculus — seems to be quite a weak and natural assumption (recall classic approximations in 
Ito-calculus for say the stochastic integral or quadratic variation which more or less by construction 
hold in probability or even Wong-Zakai type^ results for highly complex, pathdependent functionals 
are in principle included). 

1.1. Notation. Denote with £ an isolated point added to R rf which plays the role of a cemetery and 
denote with the set [0, oo). We work on the canonical path-space for sub-stochastic (i.e. possibly 
defective) cadlag processes denoted with Q, i.e. 17 is the set of paths oj : P + — > M. d U {£} with lifetime 

C(w) = inf{t > : u(t) = C} 
which are cadlag and stay at the cemetery £ after their lifetime^ £ (u). Denote the coordinate process 

X t (u) := ui (t) 

as well as 

X t - (w) := limX s (w) , A t X (u) := X t (u) - X t ^ (w) . 

Further, we introduce a a-field and filtration on J7, 

F° :=a(X s ,0< s) and J? := a (X s ,0 < s < t) . 

Note that £1 can be endowed with the Skorohod topology in which case the Borel cr-algebra equals 
F° and with this topology Q, is a Polish space (cf. |24l Chapter VI, Theorem 1.4] or [5J Chapter 0]; 
however we are not making use of this Polish structure in this article). To speak about predictable 
processes we have to define Tq- and Xq- which we simply define as = {0,17}, Xo_ = Cj further 
denote the Borel a— field of M + with ,6r + and the Lebesgue measure on M + with A. As usual we call 
any collection of random variables on Q indexed by time t G M + a stochastic process and a stochastic 
process (Ft) is said to be measurable if (t, u) i— > Ft{uS) is measurable on M + x $7 with respect to 

4 An event holds P-quasi-sure if it holds P-a.s. VP £ V where V is a subset of M a c emi (cf. [TTlUg] ^ 

^For obvious reasons, Wong-Zakai results are usually formlated for the convergence along piecewise linear paths (not 

piecewise constant) hence giving a Stratonovich- (not an Ito-) calculus. 

^With abuse of notation we use £ for the isolated point as well as the map £ : Q. — t [0, oo] . Since {lu : £ (w) < i} = 

|J r<i rg Q {uj : X r (lj) — f } 6 it follows that Q (.) is a random variable. In fact the cemetery £ is not essential for our 

arguments but it allows for intuitive characterizations of predictable processes (Proposition [TJ which is useful for some 

proofs. 
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<Sr + x F°; similarly (Ft) is progressively measurable if for each t G K_|_ the map [0, t] x £1 — > R, 
(s,w) I—)- i 7 , (cj) is (iBrg^i x J - ^ ) -measurable and a process (Ft) is said to be adapted to (J~t) if for each 
t G M + . Ft is J^-measurable. The optional cr-field O on R + x Q is generated by the real- valued cadlag 
processes adapted to (•?■?) t >.Q an d the predictable cr-field on M + x Q is generated by the real- valued 
processes adapted tcQ (J-^_) t>0 with cag (left-continuous) paths on (0, oo), cf. [TO, pl21-IV] for further 

properties of these cr-algebras. Given (t, oo) G x and r G R d U {£}, we denote co t,r G the cadlag 
path which coincides until time t with oo but has a jump at time t in direction r G M d U {£} and stays 
constant after time t, viz. 

uj t ' r (s) = oo (s A t) + rl s > t , s£l + 

(with the convention a + £ = £ for any a G M d U {C}); further denote with a; A ( the path oo stopped at 
time t, viz. 

^At (s) = oo (s A t) , s G M+. 
These two perturbations of the path oo play a central role in this article and lead to the definition of a 
causal time and space derivative of a process in Section [2] However, at this point we only recall that 
such perturbations arise naturally when working on the canonical path-space (Q,F°, {J~t)) and lead 
to an intuitive characterization of predictable and optional processes. 

Proposition 1 (Dellacherie-Meyer (TOj pl47-IV]). 

(1) The predictable cr-field V on M + x 17 is generated by the two maps 

(t,oo) i-> t and (t,oo) H> 
and the optional cr-field 0° on K + x Q is generated by the two maps 

(t, to) i y t and (t, oo) i y to At - 

(2) A measurable process F is predictable (wrt to V°) iff 

F t (to) = F t (to 1 A V (t, w)eK + x(! 



and a measurable process F is optional (wrt to O ) iff 

F t (oo) = F t (co M ) V (t, w) G P+ x n. 

We say that a sequence (it (n)) n of partitions of M + (i.e. ir (n) = {t^} k>0 where G R+, i£ < 
and ig = 0) converges to the identity if 

sup tj? — iJjLi — >■ and supt^ — > +oo as n — > oo. 
fc>o fc>o 

Denote for a given t G M+ with i 71 ^™) resp. t 7r ( n ) the nearest elements of the partition ir (n) to the 
right resp. left of t with the convention t G [t^-( n ), t 71 ^™ - )) ; similarly we use 7r ( n )t resp. n / n \t with the 

convention t G ^( n )*) 7r ^^ • Further define for to G the path a; 7r ( n ) G as the piecewise constant 
approximation^) of oo along ir (n) 

(1.3) w'W (t) = U (0) + M**)-«(<2-i))i*»j- 

Given a probability measure P denote with (f2, J 71 ", .Ff , P) the usual augmentation (i.e. right-continuous 
and complete) of (O, F°, Ft, P) . Denote with _A4* erm the set of probability measures P under which 
the coordinate process X is a semimartingale with P-a.s. continuous trajectories and similarly with 
M c resp. A4 l °° the space of measures under which X is a martingale resp. locale martingale with 
P-a.s. trajectories. Above objects and terminology (measurable, the optional and predictable cr-fields, 
etc.) can be defined with respect to the completed cr-algebra F ¥ resp. (Ff) instead of F° and (J~t) , 
e.g. giving rise to the predictable cr-field V v , the optional cr-field ¥ (instead of O ,V°) etc. Recall 
that two processes F and G are said to be indistinguishable/a version of each other on (fl, F v ,Ff ,¥) 
if ¥ (Ft = Gf Vt G M+) = 1; since parts of this article require us to argue pathwise (i.e. with a fixed 



this is equivalent except at t — to being adapted to (Ft) t>Q ■ To quote Dellacherie-Meyer |10l pl21-IV], "in all 
considerations on the predictable cr-field, time plays the devil's role". 
8 With the convention <" + r = £ for r £ R d U {Q. 
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version rather than modulo indistinguishability) we decided to be rather pedantic about spelling out 
quantifiers for all statements. 

2. Causal functionals and causal derivatives 

Definition 2. We say that a map F : R + x £1 — > R is a causal functional if 

F t (oj m ) = F t (oj) V (t, w) G R+ x 0. 

Remark 3. We do not require measurability of causal functionals with respect to the uncompleted 
CT-algebra (B^ >0 x J 70 ); in other words not every causal functional F is a process on (Q,J-°,F^) but if 
F is causal and (<6r >0 x J 70 ) -measurable then F is an £?°-optional process on the canonical path-space 
(n,T°,jf) (by Proposition [D see also Remark I14p . Of course, every O -optional process is a causal 
functional. 

Following Dupire |12| (see also earlier work of Fliess |18| ITT] in the context of bounded variation 
paths from which took the term causal), we define the time and space derivatives of causal functionals. 

Definition 4. Let F : R + x ft — > R be a causal functional. If V (t, oj) G R+ x fi, t < £ (w) the map 

R d B r ^ F t (oj l ' r ) G R 

is continuously differentiable at r = then we denote with V-Ff (w) = (c?i-Ft (w) , . . . , d^Ft (a;)) its 
Jacobian and set VF t (oj) = for (t, oj) with t > ( (oj). We then say that F has VF : R + x — >■ R d as 
causal space derivative. Similarly, we define the matrix-valued second causal space derivative AF = 
{didjF)ij—l as the Hessian. 

Definition 5. Let F : R + x SI — > R be a causal functional. If V (t, oj) G R + x $7, f < £ (oj) the map 

F t+r (w At ) G R 

is continuous and has a right-derivative at r = then we denote with doFf (cj) this right derivative 
and set d^Ft (oj) = if t > £ (w). If additionally r i— )• 9o-F r (w) is Riemann integrable then we say that 
F has the map d$F : R + x Q, — > R as causal time derivative. 

Remark 6. These derivatives can be seen as natural extension of the usual time and space derivatives 
of functions, e.g. if F t (u) = f (t,X(t,u)) and / G C 1 ' 1 (R+ x R,R) then d F t (oj) = f t (t,X(t,u)) 
and VF t (oj) = ^ (t,X(t,oj)) (if t < £ (a;)). Further, it is easy to verify the analogues of the standard 
rules of differentiation, i.e. chain and product rules, etc. 

Remark 7. If oj and w coincide on [0,t] then V-F r (u>) = V-F r (u)) Vr < t and the same statement is true 
for AF and 8qF, i.e. the causal derivatives of a causal functional are again causal functionals. 

Remark 8. These causal derivatives are defined pathwise, hence do not respect null-sets on (Q, F v , J-f , P) 
for a given P G j\4 s c erni (unlike the Frechet derivative in directions of the Cameron-Martin space if P 
is Gaussian, as used by Bismut, Malliavin et al.). It may happen that two processes F and G are 
indistinguishable on (Q, F^ , J-f ,P), both have causal time and space derivatives but apriori there is 
no reason why these causal derivatives should actually be indistinguishable from each othei^. To sum 
up, versions matter and we need an additional regularity assumption. 

Note that throughout this section we avoided the language of probability /measure theory and above 
remark shows how things can go wrong — in fact above remark might disturb our reader because it 
implies that we have to drop the elegant, usual approach of stochastic analysis to regard two processes 
as equal if they are indistinguishable under the measure P. Instead we work pathwise. We hope 
to reconcile her in Section H] where it turns out that under an additional assumption on F (namely 
regularity as introduced in the section below and causal differentiability) the examples in Remark [8] 
are to a large extent excluded (Proposition [29]). In the mean time we simply ask for her patience and 
mention that situations in stochastic analysis in which indistinguishability is not a sufficient criteria are 
not too uncommon: prominent examples include Clark's robustness problem in nonlinear filtering [7], 
quasi-sure analysis via aggregation [33] , pathwise expansions [6] or the recent interest in pathwise 
delta-hedging arguments where in all these cases the existence of a "good pathwise version" is important. 

9 e.g. let F t (oj) = 0, F t (oj) = Ia^xm^o an d Ft (w) = cA t X (to) for some c £ R. Then F,F and F are indistinguishable 
on (fl, J- 9 , J-f , P) VP £ M° em ' , F has causal time and space derivatives (all equal 0), however F does not even have a 
causal space derivative and F has a space derivative equal to the arbitrary chosen c! 
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3. REGULAR AND DIFFERENTIABLE CAUSAL FUNCTION ALS 

We have to cope with the problem that functionals on an infinite-dimensional (path-)space can be 
causal differentiable but not continuous with respect to uniform topology. A simple solution is to 
additionally require such a continuity however, since we are interested in functionals of unbounded 
variation paths this is indeed a very strong assumption. This section introduces a class of functionals, 
regular enough to prove a functional Ito formula but still rich enough to include aggregators of the 
stochastic integral, quadratic variation, Doleans-Dade exponential etc. 

3.1. Regular, causal functionals. The proof of the functional Ito formula requires to understand 
the behaviour of causal functionals under pathwise approximations of the coordinate process X. We 
briefly recall that pathwise approximations of functionals of unbounded variation paths are more subtle 
than the case of bounded variation paths, already under a fixed measure P S A4]l emt '. 

Example 9. Fix P € M s c emi and consider the stochastic process on (Q, J-f,F\ defined as 

F t {u)) = (J x^xA^)- (I'l^jn 

(any version of the stochastic integral on the right hand side), i.e. Levy's area process. To the best of our 
knowledge, the first explicit example of an approximation of the process X which leads to a correction 
term is due to McShane |27| for the case when X is a Brownian motion on (CI, J-" p , J-f, P) , that is for an 
arbitrary c£R there exists for P-a.e. u) a sequence (w n ) n C f2 s.t. \X (io n ) — X (oS)\y Q # ^ — > n but the 
process F (oo n ) converges to the process (Ft + c.t) t P-a.s. — we do not spell out the details (but refer 
to the excellent presentation in 123"! Chapter VI] in the context of Wong-Zakai approximations) 
though we briefly recall the main idea: consider the dyadic partitions and interpolate u) between two 
points not linear, but by choosing between two interpolating functions to construct oj n . The key is to 
make this choice dependent on the sign of (an approximation of) the Levy area surpassed in this time 
intervals. This highly-oscillatory perturbation is not canceled in the limit as the mesh size tends to 
and picked up in an additive correction term of the iterated integrals of X. Also note that such a 
behaviour is not a question of Fisk-Stratonovich vs. Ito integration (for the Levy-area F it does not 
matter which integration is used since the quadratic variation brackets cancel). Sussman (cf. [35, 22j) 
even shows that for arbitrary N £ N one can find approximations such that the first N iterated 
(Stratonovich) integrals converge to the iterated integrals of X but lead to a correction term on the 
(N + l)-th level of iterated integrals. 

A classic strategy to cope with such phenomena is to restrict attention to a class of "natural" finite- 
dimensional approximations, typically piecewise constant or piecewise linear approximations (depend- 
ing on an Ito or a Stratonovich approach), e.g. to prove Wong-Zakai theorems [37 \ I38 | l23 ] Chapter VI], 
large deviation results a la Varadhan, Freidlin-Wentzell, Azencott |36} |2~T] [2], or Stroock-Varadhan 
type support theorems |34] etc. In view of an Ito-calculus and that the causal derivatives are defined 
by piecewise constant perturbations resp. perturbations by jumps, it is not surprising that for us the 
"natural class" of approximations are the piecewise constant approximations of the coordinate process 
X (w) = uj (for the definition below recall that uj n is the piecwise constant approximation of uj along 
7r; as in (|1.3|) ). 

Definition 10. We say a causal functional F : M + x 0, — > IR is a weakly regular, cadlag functional 
if for every P G j^semi faere exists a sequence of partitions tt = (it (n)) , converging to identity such 
that 

(1) F is an (Jf) -adapted cadlag process or0 (Q,F ¥ ,Ff,F), 

(2) the (J-f) -adapted, measurable process (t,u) ^ F? (uj) := F t (u n ^) has left limits (for every 
n P-a.s.) and F™ — > F_ asn-^oo uniformly on compacts in P-probability (henceforth ucrQ)- 

If point (2) holds for every sequence of partitions it = (tt (n)) converging to identity and every 
P € Ai^, emi then we simply say that F is a regular, cadlag functional. Similarly, we say F is a 



McShane's and Sussman's approximations are not adapted but, like the usual piecewise linear approximation, they 
can be simply shifted one interval back in time to make them adapted. 
11 i.e. t F t is P-a.s. cadlag 

12 That is, VT e R+ we have sup rg[0 T] |F, n _ - F r -\ -^ n ~>oc in probability on (fi, J rP , J? , P) . 
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regular caglad (resp. continuous) functional if in point (1) we replace the word cadlag by caglad 
(resp. continuous). 

Remark 11. Instead of point (2) we could simply demand that is a cadlag (or caglad) process 
and F n — > n F ucp. This would already include a large class of functionals, however, the above, more 
general formulation allows to include functionals F which only have left limits along the approximation 
F n . 

Remark 12. It is easy to write down a functional which is only weakly regular but not regular. However, 
Section [5] shows that even processes with complex path-dependence have a "canonical version" which 
is regular, i.e. a regular aggregator which is also causally differentiable. 

The important part of Definition 1101 is point (2) since point (1) is by Proposition [1] already satisfied 
whenever the causal functional is a measurable process with cadlag trajectories on the canonical prob- 
ability space (Q,F°,Ff). An interesting subclass of functionals which are regular are the ones treated 
by Dupire |12j . 



Example 13. Let F : M+ x O -> R be an O -optional process on (O, F°, Ff) and as a map uniformly 
continuous on compacts of R + with respect to the pseudo-metric on M + x f2\{w : ( (uj) < oo} 

dDupire ((t, U>) , (t, Sj)) = \t — t\ + \uj At - ^ At\oo;[0,tVt] - 

This is the class of processes studied by Dupire [12] . Note that O C O p and the continuity with 
respect to dDupire (•; •) guarantees that F is causal and also the convergence of Ft (u; 71 "'™)) as required 
by Definition 1101 (even pathwise!). Hence, every such F is a causal, regular continuous functional. While 
this is quite a small class of processes in view of the usual processes of interest in stochastic analysis, 
it already allows for interesting examples (especially in mathematical finance where the process F can 
model the price evolution of a path-dependent option). 

Remark 14. In Definition [10] of a regular functional we do not require F to be a measurable process 
on (fl,F°,Ft) (by Proposition [1] only measurability of F is needed to make a causal functional an 
O -optional process) but only measurability on the completed probability spaces (O, J- F , (j^f ) , P) . 
This is a minor technical point but this generality turns out to be useful when dealing with causal 
functionals with very complex pathdependence where measurability with respect to the uncompleted 
<7-algebra F° resp. filtration (F®) can be hard to establish (e.g. in the pathwise Bichteler integral 
where "stopping/hitting times" appear etc., see Section [5]). 

3.2. The class C 1,2 of causal functionals. In this section, we introduce the class C 1,2 of causal 
functionals which are regular and have regular, causal derivatives. Before we do this we need another 
definition which will be only needed for the causal derivatives of a functional. 

3.3. Causal continuity in time and space. 

Definition 15. We say a causal functional F : R+ x f2 — > R is causally continuous in space, if Vi? > 0, 
Vt > there exists a function p t: R : M+ — > R+ which is non-decreasing and lim^o Pt,R {%) = such 
that 



sup 

re[0,t] 



F,.K^)-«(^)h«(h- A L ;M ) 



holds Vcj, A,Aefi with \uj\ r fl , | A 



oo,[0,t] ' l"loo,[0,t] 



A 



< R. 

oo,[Q,t] 



Similarly, we give the corresponding definition in the time variable. 

Definition 16. We say a causal functional F : R + x 0, — > R is causally continuous in time, if \/R > 0, 
Vi > there exists a function p tt R : R+ — > R+ which is non-decreasing and lim^o pt,R (x) = such 
that 

sup \F r+Ar (u) Ar ) - F r (w)| = p t ,R |A| [(Hl + sup \ui r - w r _| 

r€[0,t] \ r£[0,t] / 

holds G 0, VA € C([0,t] ,R+) with M^.^ , lA^.^ < R. 
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Remark 17. To motivate above definitions note that they are immediately fulfilled for Ft (oj) = 
f (t,Xt(uj)) whenever / G C([0, T] xR",l) (since / is uniformly continuous on compacts, hence 
has a modulus). Intuitively the continuity in space means that F does react to jumps in the underly- 
ing in proportion to the jump size and the continuity in time implies that jumps of F are only due to 
jumps in the underlying (not due to progression of time alone). 

Remark 18. If F is continuous with respect to drjupire ( Example LT3|) then F is causally continuous in 
time and space and the latter are much weaker requirements for functionals than pathwise continuity 
in the sense of Example [13] since we only require continuity with respect to jumps resp. stopping 
the path at running time, not continuity with respect to perturbations of the whole past of the path 
(e.g. causal continuity in time and space hold for the pathwise version of the stochastic integral or 
quadratic variation in Section [S]). 

3.4. The class C 1,2 . We now introduce the class of functionals we are interested in. 

Definition 19. We denote with C 1 ' 2 the set of all causal functionals F : M + x SI — > R which fulfill 

(1) F is a regular continuous functional which has one causal time and two causal space derivatives, 

(2) 8qF ,V F ,AF are regular caglad or regular cadlag functionals, 

(3) VF, AF are causally continuous in time and space. 

Remark 20. For functions f(t,Xf) continuity is guaranteed by differentiability, however for causal 
functionals the questions of causal differentiability and continuity wrt to the underlying have to be 
treated separately. This is not surprising since we work with nonlinear functionals on an infinite 
dimensional (path-)space. 

Remark 21. We immediately see that {(t,oj) H- / (t,u(t)) : f G C 1 ' 2 (R + x M d ,P)} C C 1,2 , i.e. as our 
notation reveals, we think of C 1 ' 2 as the analogue for general a (X)-adapted processes of the class 
of processes f(t,X t ) for / G C 1 ' 2 (R+ xR d ). Recall Example [TJ which shows that differentiable 
processes with pathwise continuous dependence on the underlying are included in C 1 ' 2 . But most 
important for us, the results of Section [5] below show that C 1,2 includes functionals which depend on 
the past of X (oj) = oj in a more complex way like stochastic integrals, quadratic variation, Doleans- 
Dade exponentials, compositions thereof, etc. 

We have arrived at a point comparable to the moment in a basic analysis course where differentiabil- 
ity of functions has been introduced and it remains to show that many basic functions are differentiable 
and to calculate their derivative. In other words, we still have to show that C 1,2 contains aggregators 
of interesting processes (besides the examples covered by Dupire |12| resp. Example PT3|) . This trite 
observation is the motivation for Section [5l However, before that, we show in the section below that 
the functional Ito-formula extends to the class of C 1,2 functionals. 

4. The functional ito formula for C 1,2 -functionals 

We are now in position to prove an extension of the functional Ito-formula to C 1 ' 2 . The basic idea 
of the proof is similar to a standard proof of the usual Ito-formula, which is to carry out a Taylor 
expansion of order 2, and — following Dupire — the role of the usual derivatives is taken by the causal 
time and space derivatives. 

Theorem 22. If F G C 1 ' 2 then F is a continuous semimartingale on fj7, J rIP , J-f,P) VP G Ai^. emt and 

(4.1) F - F = I d F r dr + V /' ^.dX* + ~ V [' %F r _d [X\X^] r P - a.s. 

Jo i=1 Jo i,j=l J ° 

Proof. Fix t > and a sequence (ir (n)) n of partitions it (n) = converging to the identity. For 

brevity write uj n for the piecewise constant cadlag approximation uj 71 ^ of u along 7T (n) and r n , n r 
resp. r n , n r for the nearest element tt (n) to the right resp. left of r (with our usual convention r G [rn: ?" n ) 
and r G ( n r, n r}), further we use the notation X s ± (oo) '■= U) s ,t := w{t) — U) (s), s,t G R+. Throughout 
assume that n is big enough s.t. t < sup fc ti. For every u G f2 with £ (cj) > t write 

(4.2) F t n(uj n )-F (LO n ) = F^K^-FoKo) 



Yj Ft k ( w a^) ~ F q_, K 

k 
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(causality of F implies Ffn (uj n ) = Ffn (w™ t n) and Fq (uj n ) = Fq (w" )) where the sum is taken over 
all k s.t. 7r (n) 3 tt_i , fu < t n and split the terms on the right hand side into 



Ffn [oj 



J /\t\ 



Fi 



Ffn [OJ 



:Sj?(w) 



The intuition is that measures the space increment at time and TP measures the time decay 
on the interval fe_i>£fcj- In step 1 an d 2 below we show that for every P £ .M;! erm we have on 
(p,, J- v , J-f , P) the following convergence in P-probability 



(4.3) 
(4.4) 



Y,Tk ->n-voo I OuF,.(h: 



Further, since F is a regular functional the process (t, u) i— > Ft- {oJ n ) converges ucp to F- and since 
t i — y Ft is P-a.s. continuous (|4.2p converges in probability to Ft — Fq. This together with (j4.3f> and 
()4.4p implies (|4.ip for fixed t. The continuity of i 7 then allows to remove the time dependence of the 
null-set. 

Step 1: EfcSfc Jo VF - dX + hfo Tr [&F-d[X]}. FixwGfi with ( (u) = oo and note that 

. Taylor's theorem applied to the function 



U) 



guarantees the existence of a #J? (a;) 6 [0, 1] s.t. 

i 



k-lj ''k-l'^k 



hi 



Ari j-nLU-tn xn . 



Hence 



(4.5) -Ft« - F t n ( w » ) = ^ ^ (^ tfe _ i ) 



where 

(4.6) r n , fc (a;,A) = 



dijF t n ( (a;" t n 



tJf,«Jf(«)A 



AW VA 6 M d . 



By the causal continuity in space of AF = (dijF) i . =1 d we can estimate (using that 



(w n ) fc **-!'** , [ ^ 



UJt n t n 



< 2\u\ ,\co n \ < 2|w|) 



OijFq ( (w^ t n 



5; 



(4.7) 



< Pf1\w 



( M&^.tg ) < ft,2M ( ) 
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with p t}2 \u\ (0+) = 0. Taking in (|4.5|) yields 



(4.8) 



with R n (uj) 

\Rn (W) 



< 



< 



(W) 

Y2k r n,k (^'^tj? 1 ,tfe) an( i ^ (|4.6p and (|4.7p we arrive at 

tS,ej(«)« t g_ 1)t j; 



CJj-n til 
L k-l' l 'k 



< 5ft, 2 | W | (sup |wtn_ ijt j^ EE (| x fe_i^ M + | x 4-i^ M ) 



Note that YJ fc 
P-a.e. u) we have 



i,j k 

1 2 



X 4 



(W) + ff (w) 



>■„ [-X^L + [X] t m probability and obviously for 



Pt,2\< 



sup 

k 



UJf™ j-n 



>n0 



hence \R n (.)| — > n in probability It remains to show convergence of the first two sums on the rhs in 
(|4,8p . Therefore we rewrite Y\ Vi^™ ( oflm ) -Xfi t n (uj) by adding and subtracting Vi^« ( w^n 

fc y fc__iy fc — l 1 fc fc — 1 y fc— 1 

and see that it is sufficient to prove 

k 



VF_ • dxj (w) 



UJ 



A/" 



For the first sum note that VK 



r fc-i V M k-i 



- Vi*t« i (w n ). For brevity we introduce the processes 

F\ (t,u) (->■ I t n (u) := VF t (u n ) and I t (uj) := VF t (uj) and note that I™ ^ n 7_ in P-probability (since 
V-F is regular by assumption). For P-a.e. uj we have 



K) • ^-!.*S H = (/ T > X r) M = (/ ^rdX^j (UJ) + (| (I- - I nr ) dX^ (uj) 

(recall the notation r £ ( n r, n r\). Note that (/ n r) re R is a piecewise constant caglad process (therefore 
a simple integrand), hence f I nT dX r converges ucp to I-dX = / V F^dX; since J" converges ucp 
to I_ = VF_ we get that J (J" r — 7 n7 .) (iX converges ucp to 0. It remains to estimate 

( 4 - 9 ) E ( Vi ^ Ku) " (u^nj) • X n _ vt n (uj) . 

fc 

Therefore note that (VF t ™ 1 ~~ ^^*fc-i ('Atg )) ^ ' nence above sum can again be seen 

as the stochastic integral of a simple integrand and causal continuity in time implies 



sup 

fc 



Vi Vi (^AtJ^J ^ P«,2|w| (mes/i (vr n ) + sup ^w t »_ 1 - ^»_ 2 ) 



> n for P-a.e. a; 



which is sufficient to conclude that (|4.9p converges to in probability. To sum up, we have shown that 
the sum YJ fc VF t « +i (^m^ ) ' -^Jf-i^E ( a; ) a PP ear i n g in (|4.8p converges ucp to J'^VF-dX . The very 
same arguments imply the convergence 

ft 

Tr [AF_ -d[X]]. 
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Step 2: Y^k^k ~~ *n f()doF r dr. Causal time differentiability of F implies that 



K+ 3 • ^ F t n_ i+ . ^ t »_ i 

is continuous and has a right-derivative oi which is additionally Riemann integrable. This is 

enough for a modification of the fundamental theorem of Riemann integration to apply (see [5]): 

^-i+tM-i) ( W ^-i) " Ft £-i K-J = / ^-i+r ( w Atu_J ^. 

Taking the sum Y2k over an ^ s -^- 71 ( n ) ^ ^fc-i>*fc — gi yes 



£2? = f d F r (u-J dr. 
Jo 



'0 

Since do-F is causal, doF r (u)™ r ) = 8qF t (u: n ) for r it (n) (which is a Lebesgue null-set of [0, i]), we 
have shown 

V£= f* d F r (uj n )dr. 
k Jo 

By assumption the integrand converges ucp on (fi, J-* p , J-f, P) to 8qF, hence we have shown the con- 
vergence 



k J ° 



□ 

Remark 23. Theorem[22]says that F G C 1 ' 2 implies that F is an Ito-process with regular coefficients. In 
Section [5] we show (Theorem [33] and Example 143 P the converse, i.e. every process of the form /j, r dr + 
J*q a r -dX r has (provided /U, a are regular enough) an aggregator in C 1 ' 2 . Theorem 1221 immediately 
gives the explicit Bichteler-Dellacherie-Meyer decomposition of F under every P G M l ° c C Ai^. emi . 

Remark 24. Dupire showed above formula for the case when (t, to) \— > F (t,uj) is pathwise continuous 
in the (pseudo-)metric of Example [13] and P is the Wiener measure. Cont and Fournie [8] give a 
mathematical precise argument and two proofs: a pathwise, analytic proof in [8] and a probabilistic 
proof in [9]. Further, both apply to functionals which depend pathwise continuoulsy on oj and a second 
component. 

Remark 25. Motivated by the case of no pathdependence, i.e. Ft (w) = / (t) one could argue that a natu- 
ral criteria for a fundamental theorem of calculus is absolute continuity instead of right-differentiability 
In fact, alternatively to Definition [5] one could say that F has a causal time derivative if r i— >■ F r (com) 
is absolutely continuous and above proof then applies modulo obvious modifications. 

Remark 26. We restricted attention to continuous semimartingale measures Ai^ emi (i.e. t \— > Xf is 
a.s. continuous) but under extra assumptions on the regularity of F one can also in the case when 
X is a cadlag semimartingale subtract the small jumps before steps 1 and 2 (see also [S]). However, 
the assumptions on F get more technical and in view the applications given in the sections below we 
decided to stick to the case J^[ s ^ mi since our focus lies on the lack of continuity oj i— > F_ (cj). In the 
same spirit one could ask for a Tanaka formula etc. 

The proof of Theorem [22] requires only regularity of F and its derivatives along one sequence of 
partitions converging to identity and with respect to a fixed measure P G A4^. emi , i.e. we have shown 
a more general statement: denote with C w ' the class of functionals which fulfill Definition 1191 when 
we replace regular with the condition that for every P G M^. erni there exists a sequence of partitions 
7r = (ir (n)) converging to identity such that F and its derivatives are weakly-regular along n. 



13is 



%ot only at . = since f( t »_ 1+r)+ . = *(tjf_ 1+ r) + . ( ("Vj A ( t »_ 1+r ) J Vr G R +' henCe 

F t n_ i+r+ . (u^ J - F t n_ i+r (c^J = 9oFt»_ 1+ , (^ t n_J • +o(.) as . \ 0. 
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Corollary 27. If F G then F is a continuous semimartingale on (ft, J-" p , J-f, P) and 

F-F = [' d F r dr + J2 f diF^dXl + \ V /' d {j F r d [X\X>\ F - a.s. 
Jo i=1 Jo i,j=l J ° 

Applied with Ft (w) = / (t, Xt (w)) all the above reduces to the standard Ito formula and motivated 
by this one can ask for generalizations of the Feynman-Kac formula or the classic link between (sub- 
)harmonic functions and (sub-)martingales. Indeed, all this was already done by Dupire [12J and one 
can follow the same arguments the only difference being that we now cover a larger class of aggregators 
- though we still have to convince the reader of this, i.e. that C ,2 includes interesting functionals 
besides the functionals of Example [131 Before that we briefly come back to the non-uniqueness of the 
causal derivatives seen as processes as pointed out in Remark [8l 

4.1. Uniqueness of causal heat operator and space derivative acting on processes. Due 

to the pathwise nature of the causal derivatives, it may happen that two different functionals give 
rise to processes that are indistinguishable, both are differentiable but these derivatives are no longer 
indistinguishable processes. 

Example 28. Let d = 1. In Section[5]we introduce a causal functional, denoted LY] S ^ : P + xO->M, 
s.t. [X] BK G C 1 ' 2 and under every F £ Mf m \ [X] BK is indistinguishable on (ft, J" p , F ¥ , P) from the 
quadratic variation process of X on (ft, J-" p , J- v , P) . Consider the two causal functionals, both are in 
C 1 - 2 , 

(t,uj) H- [X]f K (u) and (t,u) H- t 
which are indistinguishable processes if X is a Brownian motion under P however their causal derivatives 
are distinguishable under P (e.g. d (t) = 1 but d ([X] BK ^j = P-a.s., cf. Corollary ESJ . 

However, the situation is not too bad: in above example the first spatial derivative V and 8q + 
gXV [A • e£ [X]] applied to these functionals are again indistinguishable processes on 
Theorem 1221 gives us the simple explanation that 8qF and \AF both replicate the behaviour of F on 
the time-scale of t as opposed to VF which replicates the behaviour of F on the \/^-scale, hence we 
can only expect uniqueness of the "causal heat operator" d$F + ^Tr [AFd [X]] and VF. 

Proposition 29. Let F,G £ C 1 ' 2 , P G M l ° c and assume [X] = J a 2 dr under F for some continuous, 
J-f -adapted process a. Further assume F and G are indistinguishable on (ft, F v , J-f, P) i.e. 

F=G. P - a.s 

Then 

(4.10) d F + ^Tr [AF ■ a 2 } = d G + ^Tr [AG ■ a 2 } P-a.s. 

and 



(4.11) / VF-dX= / VG-dX P-a.s. 

Jo Jo 

Proof. As in [9], this is a simple consequence of the uniqueness of the semimartingale decomposi- 
tion: from Theorem [22] it follows that Hf (oj) := Ft (cj) — Gt (u)) is a continuous semimartingale on 
(ft, J*, jf,P) but every continuous semimartingale is a special semimartingale, that is the decompo- 
sition H = M + A into a local martingale M and a process A of locally bounded variation is unique up 
to P-indistinguishability which already implies (|4.1ip . Since H is indistinguishable from the process 
this implies 



A. = J' (d H r + X -Tr [AH r ■ a 2 ]^ dr 



is indistinguishable from 0, hence we have that P-a.s. d H t + \Tr [AH t ■ of] = Lebes gue a.e. which 
implies (j4TT0|) . " □ 

Remark 30. Similar path-dependent heat operators appear already in seminal work of Kusuoka-Stroock 
[26j using Malliavin calculus techniques on abstract Wiener spaces. However, no Feynman-Kac formula 
was given and the abstract Wiener space setting is of course essential for the results in [26J. Further, we 
mention the interesting approach of Aim [TJ on semimartingale representations via Frechet derivatives. 
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Remark 31. Under stronger assumptions on P (e.g. when X is a non-degenerate Brownian martingale) 
one can also deduce the indistinguishability of VF and VG since then J VF • dX = j VG • dX already 
implies indistinguishability of V-F and VG. This is essential to define an extension of the causal 
derivatives V and do + \Tr [A.d [X]] which are defined pathwise to operators on (equivalence classes 
of) Ito-processes via closure available under a fixed measure as in Malliavin calculus cf. O Section 5] 
and Remark [3H We note that in classic work of Kusuoka-Stroock |26| a similar phenomenon appears 
where the Frechet derivative D and only the "Malliavin heat operator" A = + \TnD 2 (not only 
or ^TjjD 2 ) are extended by continuity and closure on abstract Wiener spaces (cf. |26| pll, Remark 
1.17ff]). 

5. The ITO-INTEGRAL AND QUADRATIC VARIATION PROCESS AS CAUSAL FUNCTIONALS 

In this section we show that one can construct a pathwise version of the stochastic integral of a cadlag 
process against the coordinate process X which is an element of G 1 ' 2 , i.e. a regular, differentiable causal 
functional and similarly for the quadratic variation. To achieve this we make use of the path-wise Ito- 
integral and quadratic variation as introduced by Bichteler in 1981 using adapted Riemann sums and 
subsequent simplifications by Karandikar [H [25] . 

Example 32. To appreciate Bichteler's pathwise integral let us briefly recall why pathwise construc- 
tions are non-trivial on the simpler example of the quadratic variation process in the one-dimensional 
case (d = 1). Once a probability measure P £ M^ emi and a sequence of partitions (vr (n)) n is fixed, 
standard results guarantee the existence of a process [X] on (Q, "F p , J-f, P) (for brevity let d = 1) such 
that 

, 2 

Xq + r^*fc A * — i Ai [X]t un if orm ly ° n compacts in P-probability. 

Hence, a candidate for a pathwise version (which is also causally differentiable) of the quadratic vari- 
ation process is the map 

J \u (0)| 2 + li m?woo £ | w (q At) — oo A t) | 2 , if this limit exists 

(t,oo) i-> F t (cj) = < 

10 , otherwise. 

Indeed, if P £ M s ^ m% is such that the coordinate process X is a Brownian motion and (jr {n)) n are 
dyadics then above F is a process on (fi, J-" p , J-f, P) indistinguishable from [X]. However, this is not 
true under every P £ _y\^* e ™ or an y sequence of partitions (tt (n)) converging to the identity — in fact, 
^2 k \u) (t^ A t) —oj (tj!_i A t) | 2 converges in general only in P-probability. (Of course, for every given 
P £ j^[sem% one can a i wa y S £ nc i a f as ^ enough vanishing sub-sequence of (vr (n)) n to ensure convergence 
P-a.s.) The same problem appears, afortiori, for the stochastic integral where path-regularity of the 
integrand has to be considered as well, etc. 

The main result of this section is Theorem [33] below which shows that the causal space derivative 
acts as the inverse of a pathwise stochastic integral. 

Theorem 33. Assume Z is a real-valued cadlag process on Kl,J-f, J 70 ) which is cadlag regular. Then 
for every i £ {1, . . . , d} there exists a regular cadlag functional 

j| : R + x!)->R 

such that 

(1) J l z has causal derivatives <9oJ^,VJ^,A and these are again regular functionals, 

(2) J l z is under every P £ J\A s c emi indistinguishable from the Ito-integral J Q Z^dX 1 on (CI, F r ', J-f, P) , 
i.e. 

J z {.,u) = (J Z^dX^j (oo) forP-a.e. oo. 

Moreover V(f,u)el + x!],i<( (oo), 
d J l z {t,oo) = 0, 

X7J z (t,oo) = (0, . . . 0, Zt- (oo) , 0, . . . 0) T (only i th component non-zero), 
AJ z (t,oo) = 
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and if Z- is causally continuous in time then J z G C 1,2 . We also use the notation f Q Z_ (w) d BK X 1 (co) 
for the process J z (oj) (d BK stands for Bichteler-Karandikar) , resp. in the multidimensional case Z = 
(Z*)f =1 we write f Q Z_ ( w ) • d BK X ( w ) for £? =1 4* (<")' 

Remark 34. In [9J Section 5] V is extended under a fixed measure P to an operator Vx acting on square- 
integrable (standard) stochastic Ito-integrals which can be identified via the Clark-Hausmann-Ocone 
Theorem as the predictable projection of the Malliavin derivative. We refer the reader to Section 
5] , but remark that Theorem [33] shows that the causal derivative V acts as the pathwise inverse to 
an aggregator of the Ito-integral, e.g. in the case of a Wiener functional £ G L 2 (Jy) that has the 

representation £ = Ep [£] + Z_ • dX with Z as in Theorem [33] then £ has a representation as 

£° G C 1 ' 2 , given as £^ (w) := Ep + J - ^ (cj) • d BK X (w); more interestingly this is not restricted 

to Wiener functionals provided of course the existence of such a representation £ = m + Z_ • dX. 
The considerable price is that already in the Brownian context one has to assume some regularity 
of Z but martingale representation does not guarantee any such regularity. However it might be 
a reasonable assumption for applications (e.g. implementable Delta-hedging strategies, integrands of 

the form Zt = v (t,Xt, [X]f K ^j etc; note also that using the non-pathwise L 2 -extension Vx leads to 

not completely trivial issues about consistent approximations by the pathwise operator V acting on 
cylinder functions). 

The process J z in Theorem [33] is constructed via the pathwise Bichteler-Karandikar integral which 
we briefly recall. 

Theorem 35 (Bichteler [4j,Karandikar |25^ Theorem 2 and 3]). There exists a map 

I : D(R+,R) x D(R+,R) ->D(R+,R) 
s.t. for any filtered probability space (Ft*j satisfying the usual conditions, any real-valued 

semimartingale X and any adapted, real-valued, cddldg process Z on I Q, P, I T% ) , P ) we have 



l(z(u),X(uj) (.) 



o 



Z-dX (oj) F-a.s. 



Remark 36. The set of full P-measure on which the equality holds depends of course on the processes 
Z and X. 

Remark 37. Another well-known result on pathwise integration is Follmer's Ito-integral "sans proba- 
bilites" |19) . However, this allows only to give a pathwise version when the integrand is of the type 
J* V/ (X s ) ■ dX s , i.e. a non-pathdependent integrand which is in gradient form (the latter is a strong 
restriction in multi-dimensions d > 1, cf. Example [9] for a process which is not of this type). The non- 
pathdependence can be relaxed, see |8j, however the gradient form is essential for Follmer's argument. 
Let us also mention the approach via capacity as in Denis and Martini to construct an integral for 
quasi-sure integrands and Peng's approach via PDEs |28] which lead to a much less explicit integral 
but on the other hand apply to less regular integrands than the Bichteler — Karandikar integral. 

Remark 38. The map I is constructed as follows: for x,z G D (R+,K) and n G N define the sequence 
(rf ) i>0 C K + U {+00} recursively a0 

7? = inf jt > t?_ x : z t - z r n_ i > 2~ n , t G M+| and r n = 0. 
Define I n : D (R + , R) x D (M + , R) -> D (M+, R) as 

(5.1) l n (z, x) (t) = z (0) x (0) + * W) {x (rf +1 At) — x (r™ A t)) 

i>0 

and set / (z, x) = lim n I n (z, x) whenever this limit exists in the topology of uniform convergence on 
compact sets on D (R + ,R), otherwise set I (z,x) = 0. 
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With the convention inf = 00 and t^ +1 = 00 if r, 
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Proof of Theorem VJ'Ji We define 

**(£«;)). ,ift<CM 



(5.2) J l z {tM--- 



.else 



where /(.,.) : D (R+, R) x D (R+, R) -> D (R + , R) is the map constructed in Theorem [35] and Remark 
38l For brevity write J instead of J l z throughout this proof. Point (2) follows immediately from 
Theorem [35] 

To see the causal differentiability, consider (t, uj) G R + x £1 with t < C( w )- I n this case, if the 
adapted Riemann sum (|5,ip of the Bichteler-Karandikar integral / \Z (uj) ,X i (uj)) (t) converges as a 
uniform limit then this is also the case for co t,r or ujm for any r = (r 1 , . . . , r rf ) E R d : first note that Z is 
O -optional (since 0° is generated by the cadlag processes) and so by Proposition [JJ the trajectories of 
Z (uj ) and Z (uj/\t) coincide on [0, t], hence the sequence (r™ (uj)) i>0 is the same as (r™ (^At))j>o when 
restricted to [0, t]. Then directly from the definition of the Bichteler-Karandikar integral as limit of 
JO it follows that 



J(t + e, co M ) = I{Z (uj m ) , X 1 (oj M )) t+e = I (Z (uj m ) , X 1 (co)) f = I (Z (uj) , X 1 (co)) t , 

which implies do J (t,co) = 0. Similarly, we can use that the trajectories of Z (uj t,r ) and Z (co) coincide 
on [0, t) to see that 

J(W> r ) = J(Z(o;^),X i (a;^)) i = /(Z(a;^),X i (a;)) t + Z t _(^)r i 

which shows the claimed first space derivative. By 0°-measurability (resp. causality) of Z 
Z t _ (aM) = hmZ r (aM) = lirnZ, = Z t _ (ufi-**>) 

and similarly Zt- (uj) = Zt- (uo t '~^' tUJ ) which implies AJt (uj) = 0. 

To see that J £ C 1 ' 2 we first show that J is a continuous, regular functional, more precisely 
that VP € ^A s ^ m% the map (t,u>) t— >■ Jt(co n ) is an adapted cadlag process that converges ucp to 
the (P-a.s.) continuous process (t,uj) i-t J t (uj) on (p., J rP , jf,F). Therefore fix P G Mf mi , any 
sequence of partitions (it (n)) n converging to the identity and denote the associated piecewise constant 
approximation of co with (uj n ) n . From the very definition of J we have V (t, uj) £ R+ x $7 

Jf (co) := J t (uj n ) = I (Z (co n ) , X 1 (co n )) (t) . 

Now, 

(t,co) h-> Z? (uj) := Z t (co n ) and (t,u) ^ X t n (uj) := X\ (u n ) 

are cadlag processes on (f2, J-^, J-f, P) and X n is of bounded variation (on compacts) and therefore a 
semimartingale, hence Theorem 1351 guarantees that 

J n (uj) = QT ZW 1 ^ (w) for P - o.e. w. 

so especially J n is a cadlag process. To see that J" — >• J_ ucp note that for every n we have P-a.s. 

/ Z n _dX n = [ (Z™-Z_)dX n + [ Z_dX n 
Jo Jo Jo 

= > ( ^ n — Z— ) , „ Xt n t n ~\~ s Zt" _ Xt n t n 

/ J V — fc-l' *: / / r fc-l T k-l> l k 

fc:t£<- " k - l k<- 

= C n dX+ Y, Z tl ^-X tl _Mi 

J0 fc:tjj<- 

with = X^7r(n) f^t|- ~~ ^[t n A i n a ) (^h e secon d equality holds since X n is of bounded varia- 

tion on compacts, hence the stochastic integrals are indistinguishable from pathwise Lebesgue integrals, 
cf. |32^ Theorem 18]). Since Z is a regular functional, C n — > ucp and the first integral converges to 
ucp as n — > oo. Standard results show ucp convergence of the Riemann sums to J Q Z-dX. □ 

As an immediate but important corollary we get the existence of a C 1,2 version of the quadratic 
variation [X] of X. 
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Corollary 39. For every i,j G {1, . . . ,d} there exists a causal functional 

B ij : R + x n -)• R 

such that 

(1) B*i G C 1 ' 2 , 

(2) Z? 4 -' is indistinguishable from \X l ,X 3 '] on (0,, ,F F , Jf, P) /or every P € .M* 

/,"•' -V'..Y ; F -a.s. 
Moreover V (t, w) eI + xS],f<( (m), i,j G {1, . . . ,d} 

d Bi j (u) = 



and 



VB?(u) = (A t X J '(u;)l fc=i + A t ^(a;)l fc=i ) fc=lj d 
ABj J (w) = (lx=jl fc =i + lz=ilfc=j)fe 7=1 a 



We also use the notation [X*,Xi] B for B %3 . 

Proof. Since X 1 is the coordinate process it fulfills the assumptions for the integrand in Theorem 1551 
except that X % is not real-valued (only P-a.s.). Therefore set Xt (oj) = Xt (oj) lf( w ) =0Q - Hence, Theorem 
[53] allows us to the define 



(5.3) B? (u) = X\ (u) X\ (oj) - / T_ (u) d BK X j (oj) - / X 3 _ (oj) d BK X 1 (oj) 

Jo Jo 

Points (1) & (2) follow from Theorem I35[ Theorem [55] and the identity 

[X\X j ] = X l X j - [ X l dX j - [ X j dX\ 
Jo Jo 

Finally, apply the causal derivatives to B 13 (t,oj) which gives for the time derivative 

d {X i X j ) t (oj) = d Qf X 1 (oj) d BK Xi (u)j = ^ X 3 (oj) d BK X i (u)J = 0. 

and from Theorem [55J the space derivatives of the integrals follows as well 

diB? (oj) = X\ (oj) - X\_ (oj) = A t X 3 (oj) . 



□ 



Remark 40. The first spatial derivative of B %1 is indistinguishable under every continuous semimartin- 
gale measure from but the second spatial derivative is not! This is a direct consequence of the 
pathwise nature of the causal derivatives as pointed out in Remark [HJ (pathwise the first derivative is 
not equal to 0). 

Remark 41. It would be desirable to have a result which guarantees the existence of a C 1,2 -aggregator, 
i.e. given a reasonably nice subset V of J^4^ emi an d a family of processes 

|z p : P EV, Z v is a progr. measurable process on , J^f , P^ X 

which fulfill some consistency condition then there exists a causal functional Z G C 1,2 s.t. Z = Z F P- 
a.s. VP G V . Indeed, the only result we are aware of in this direction is the main result in |33[ Theorem 
5.1] which shows for an important subset V of J\/l^ emi the existence of an aggregator. However, even 
with the restriction to V it is not clear which conditions would guarantee that the aggregator is causally 
differentiable which is why we have choosen a constructive approach for the two examples of stochastic 
integration and quadratic variation which also applies to all elements of Ai^ 1711 . 

We give some applications of Theorem [55J and Corollary [5U] the first is an easy application of the 
chain and product rule. 
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Example 42. Let d = 1 and E t (to) = exp \X t (lo) - \ [X]f K (u)) . Then d E t (w) = 0,VF 4 (w) = 

Ft (u) (1 - A f X (w)) and AE t (u) = E t (u) A t X (u) (-2 + A t X (w)). Hence, Theorem [22] reduces to 
the classic identity 

E. = 1 + / F r dX on (ft,F p , Jf ,p) , P £ 7Wf mi . 



o 



The path-dependent Ito- formula, Theorem l22l shows that every element of C 1 ' 2 is a Ito- process 
A^ 6mi -quasi-sure. Using Theorem [35] we can now go the other direction: 

Example 43. Assume fj,, a : M x — > R fulfill the same assumptions as Z in Theorem 1331 Then the 
Ito-process 

d 



o JO 



is indistinguishable on (ft, J*, J=f,P), P G TWf™, from F G C 1 - 2 defined as 

F t H := f fi r (w) dr + J2 f a r _ (to) d BK X l r (u) 
Jo i=1 Jo 

and V(i,<j) G K+ x Jl,t < CM, 

O F (w) = ^ H , VF 4 (w) = a t _ (w) = (at (w) , . . . , o£_ (w)) T , AF t (w) = 0. 

(hence 8qF = [i, VF = cr_ P-a.s.) which captures the intuition that the causal time derivative 
measures the infinitesimal drift and the causal space derivative measures sensitivity to instantaneous 
changes of the underlying process X, cf. Remark 1231 

Even if we are only interested in a fixed measure P G Ai^. em ' 1 , Theorem 1221 and the results of this 
section are a non-trivial extension of the functional Ito-formula: while not the topic of this article, we 
finish with an application of Corollary 1391 to mathematical finance which allows to compare it with the 
approach of Cont-Fournie [9] [8] who treat the same example (see [20, Section 5] for more applications 
in finance relying on such a generalized functional Ito-formula). 

Example 44. For fixed P G A4^ emi the coordinate process X = [X^ . is interpreted as the dis- 
counted price process of d assets in a security market (for brevity we let d = 1 and assume no interest 
rates). Assume there exists an equivalent martingale measure Q G M l ° c ,Q. ~ P and [X] = f a 2 dr un- 
der Q for some adapted, continuous, bounded process a 2 . The usual Black-Scholes-Merton dynamic 
replication argument translates and shows that if F G C 1 ' 2 and 

d F + yAF = 0, F T = vr P - a.s. 

then Ft is an arbitrage free price for the claim tt at time t G [0, T] (i.e. a version of Eq \tt\ J-f] )o Applied 

to variance swaps we can make the Ansatz 7r = [JC]^" — k and Ft (w) = [X]^P (ui) — k + v (t, Xt (ui)) 
and above reduces to the parabolic SPDE 

dv a? d 2 v 9 /m , . m 

(for a local vol. model a 2 = Cg (t, Xt) X 2 this further reduces to the backward heat equation with source 
term x 2 oq (t,x)). Of course, more interesting applications exist for more complex payoffs (nonlinear 
options on variance, etc.) in combination with the uncertainty P G V for V C < M£ eTm ^ ne re i a t ec [ 
pathdependent PDEs and questions of uniqueness, existence, interplay with the nonlinear expectation 
£ (.) = infp g -pEp [.] etc. touch the tip of an iceberg which is currently under heavy development even 
for the case when tt is a continuous functional in uniform norm of the underlying — we draw attention 
to work of Peng- Wang |30^ I31| and work of Ekren-Keller-Touzi-Zhang |13| [J3J [15] . 



If of = a% s Xf, a% s £ R fix and ty = (Xt — K) + one makes the Ansatz Ft = / (t, Xt) and this reduces to the 
classic Black-Scholes-Merton PDE. The usual Greeks doF,X7F, AF are now functionals and it is interesting to recall 

2 

Proposition 1291 and Remark 1311 which show that as a process, only doF + is uniquely defined, i.e. in a financial 

context it makes sense to think of this non-uniqueness as the Gamma-Theta or convexity-time-decay tradeoff. 
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